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Audio processing arrangement with multiple sources 



The present invention is related to an audio processing arrangement comprising 
a plurality of audio sources generating input audio signals, processing means for deriving 
processed audio signals from the input audio signals, the audio processing arrangement 
comprising combining means for deriving a combined audio signal from the processed audio 
signals. 

The present invention is also related to an audio signal processing arrangement 
and to an audio processing method. 

An audio processing according to the invention is known from the article "A 
Signal Subspace Tracking Algorithm for Microphone Array Processing of Speech" byS. Affes 
and Y. Grenier in IEEE Transactions on Speech and Audio Processing, Vol. 5, No. 5, 
September 1997. 

In current and future communication systems like mobile telephony, video 
conferencing and Internet (TCP/IP) based communication, hands free operation becomes of 
increasing importance. Also in user interfaces that use speech recognition hands free operation 
plays an important role. 

One acoustic phenomenon which degrades^speech ^mtej^giWHty is reverberation 
due to the multipath propagation from the speaker to the microphone. This multipath 
propagation is caused by reflection of the speech signals against surrounding of the speaker, 
such as walls, furniture etc. In order to deal with this multipath propagation often a so-called 
Delay-Sum beamformer is used. In a Delay-Sum beamformer signals from a plurality of 
microphones are subjected to a delay value in order to compensate the delay differences 
between the speaker and the respective microphones. The delayed signals are combined by 
adding them. If the delay compensation works perfectly, the direct field components of the 
delay compensated audio signals will add coherently, while the reverberant speech 
components add incoherently. This will result in an increase of the speech intelligibility. 

A problem with the Delay-Sum beamformer is that it is very difficult to 
determine the delay values accurately and fast enough to track a moving speaker or to adapt to 
another person who starts to speak. This is in particular the case in reverberant rooms. As a 
result, the delay estimates may be wrong and the microphone signals are no longer added 
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coherently. Consequently, no improvement of the intelligibility of the speech signal is 



In the above mentioned article a method is described for improving the 
performance of the intelligibility of the speech signal. In said article use is made of an energy 
transfer function from the speaker to the microphones under the assumption that this energy 
transfer function will not change significantly if the speaker moves. The above mentioned 
energy transfer function has to be determined by measurements. Requiring measurements for . 
each site, makes the deployment of products using this method quite cumbersome. 

The object of the present invention is to provide an audio processing 
arrangement in which no measurements have to be performed before deployment of the audio 
processing arrangement. 

To achieve this objective the audio processing arrangement according to the 
invention is characterized in that the audio processing arrangement comprises control means 
for controlling the processing means in order to maximize a power measure of the combined 
audio signal, and in that the control means are arranged for limiting a combined power gain 
measure of the processed audio signals to a predetermined value. 

By maximizing a power measure of the combined audio signal under the 
constraint that a combined power gain measure (e.g. the sum of the power of the individual 
signals) is limited to a predetermined value, no use of measured data has to be made. 
Experiments have shown that the intelligibility of the speech signal is not deteriorated with 
respect to the prior art arrangement. 

Experiments have also shown that in the prior art arrangement so-called pre- 
echoes occur when filters having a long impulse response are used. Pre-echoes occur when 
before the reproduction of the direct field component of the speech signal, a scaled version 
thereof is reproduced. The occurrence of pre-echoes is regarded as quite annoying by a 
listener. Experiments have also shown that in the processing arrangement according to the 
invention the occurrence of pre-echoes is substantially less than in the processing arrangement 
according to the prior art. 

An embodiment of the invention is characterized in that the processing means 
comprise scaling means for scaling the input audio signals with a scaling factor for obtaining 
the processed audio signal, said control means comprise further scaling means for deriving a 
plurality of scaled combined audio signals with a scaling factor corresponding to the scaling 
factor of the scaling means, and in that the control means are arranged for maximizing a power 
measure of the combined audio signal, and for limiting a combined power gain measure of the 



obtained. It may even happen that the speech intelligibility degrades. 
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processed audio signals by minimizing a difference between the input audio signals and the 
scaled combined audio signals corresponding to said audio signals. 

Experiments have shown that using a simple scaling factor as processing means 
a very substantial improvement of the intelligibility can be obtained. A suitable constraint is 
5 now that the sum of squares of the scaling factors for the different input sources is equal to a 
predetermined constant. 

A further embodiment of the present invention is characterized in that the 
processing means comprise a plurality of adjustable filters for deriving the processed audio 
signal, in that the control means comprise a plurality of further adjustable filters having a 
10 transfer function being the conjugate of the transfer function of the adjustable filters, said 
further adjustable filters being arranged for deriving from the combined audio signal filtered 
combined audio signals, and in that the control means are arranged for maximizing the power 
Q measure of the combined audio signal, and for restricting a combined power gain measure of 

fT the processed audio signals to a predetermined value by controlling the transfer functions of 

£ 1 5 the adjustable filters and the further adjustable filters in order to minimize a difference 
Q measure between the input audio signals and the filtered combined audio signal corresponding 

j= to said input audio signals. 

5 By using adjustable filters as processing means the quality of the speech signal 

M can be further enhanced. By minimizing a difference measure between the input audio signal 

^ 20 and the corresponding filtered combined audio signal, it is obtained that a power measure of 
*j=j the combined audio signal is maximized under the constraint that per frequency component the 

sum of the power gains of the adjustable filters is equal to a predetermined constant. The 
correspondence between the two criteria mentioned above will be shown in the detailed 
description of the drawings by using a simplified example. 
25 The use of adjustable filters makes that no adjustable delay elements such as 

used in a sum-delay beamformer are required. 

A further embodiment of the invention is characterized in that the audio sources 
comprise a plurality of microphones, and in that the microphones are placed in a position such 
that their directionality patterns are substantially disjunct. 
30 By combining a plurality of microphones having disjunct directionality patterns, with 

the combining arrangement according to the invention it is obtained that the signal from the 
microphone receiving the strongest speech signal is emphasized automatically. Such a system 
can be advantageously be used in a conference system in which the sound produced by a 
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speaking person has to be emphasized, without needing a switch which is able to select the 
microphone with the strongest signal. 

A still further embodiment of the invention is characterized in that the audio 
sources comprise a plurality of microphones being placed in a linear array. 
5 Experiments have shown when a linear array of microphones is used as audio 

source in combination with adjustable filters in the processing means, the speech signals and 
their first reflections are added coherently, resulting in an improvement of the speech 
intelligibility. This configuration turned out to be more robust and showed a much faster 
convergence than configuration using a sum-delay beamformer. It is observed that in the linear 
10 array the microphones are placed on a line substantially orthogonal to the direction of the main 
lobe of the directionality pattern, but that it is also possible that the microphones are placed on 
a line coinciding with the direction of the main lobe of the directionality pattern. 
G The invention will now be explained with reference to the drawings. Herein 

lT shows: 

^15 Fig. 1 an audio processing arrangement according to the invention in which real 

C valued weighting factors are used in the processing means; 

jg Fig. 2 an audio processing arrangement according to the invention in which 

; frequency domain adaptive and frequency domain programmable filters are used; 

M Fig. 3, a detailed embodiment of the normalization means 73 used in the 

[f- : 20 arrangement according to Fig. 2. 

tfj Fig. 4 an implementation of the frequency domain adaptive filters 62, 66 and 68 

used in Fig. 2; 

Fig. 5 an implementation of the frequency domain programmable filters 44, 46 
and 50 used in Fig. 2; 

25 Fig. 6 an implementation of the audio processing arrangement according to the 

invention in which time domain adaptive filters and time domain programmable filters are 
used. 

In the audio processing arrangement 2 according to Fig. 1, an output of a first 
audio source, being here a microphone 4, is connected to a first input of the audio processing 
30 arrangement 2 and an output of a second audio source, being here a microphone 6, is 

connected to a second input of the audio processing arrangement 2. If it is assumed that the 
microphones 4 and 6 receive a signal Vjn via attenuation factors a and b, the output signal of 
microphone 4 is equal to a-Vi>j and the output signal of microphone 6 is equal to b-V IN . The 
processing means comprise here first scaling means 10 and second scaling means 12 which 
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scale their input signals with a scaling factor x respectively y. At the output of the processing 
means 1 1 the processed signals Vp and Vq are available. For these processed signals can be 
written: 

V P =a-x-V IN (A) 

and 

V Q =by-V IN (B) 

At the output of the combination means 1 8 the sum Vsum of the processed signals Vp and Vq 
is available. This signal Vsum is equal to: 

V S UM=(a-x + b-y)v IN (C) 

The further scaling means 14 and 16 derive scaled combined signals from the 
combined signal using scaling factors x and y. The first scaled combined signal is equal to 

V S ci =(a-x + b-y)-x-V IN (D) 

and the second scaled combined signal is equal to: 

V SC2 =(a-x + b-y)-y-V IN (E) 

A first difference measure between the first input audio signal and the first scaled combined 
audio signal is determined by a subtractor 24. For the output signal of the subtractor 24 can be 
written: 

V DIFF1 ^{a-Ca-x + b-yVxI-VjN (F) 

A second difference measure between the second input audio signal and the second scaled 
combined audio signal is determined by a subtractor 26. For the output signal of the subtractor 
26 can be written: 

V DIFF2 = {b-(a-x + b-y)-y}-V IN (G) 

The arrangement according to Fig. 1 comprises a control element 20 for adjusting the scaling 
factor x to make the output signal of V DIFFI of the subtractor 24 equal to 0. The arrangement 
further comprises a control element 22 to make the output signal V D jff2 of the subtractor 26 
equal to 0. In order to find the values for x and y to make both difference signals equal to 0, 
the following set of equations has to be solved: 

(a • x + b • y) • x = a (H ) 
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(ax + by)y = b (j) 

Eliminating the term (a ■ x + b • y) from (8) and (9) by dividing (8) by (9) results in: 

x a a-y ( i\ 

- = - => x = — 3 - ( < J > 
y b b 

Substituting (10) in (9) gives the following expression in y: 

'2 ^ 



- + b-y 



±b (K) 

y = b => y = 



Substituting (11) into (10) gives for x: 

±a 



x = 



(L) 



o 

~g From (11) and (12) it is clear that the value of x increases when a increases (or b decreases) 

*Z 5 and that the value of y increases when b increases (or a decreases). In such a way the strongest 

On input signal is pronounced. This is of use to enhance a speech signal of a speaker over 

g.,,j 

m background noise and reverberant components of the speech signal without needing to know 

^ the frequency dependence of the path a and b from the speaker to the microphones as was 

M needed in the prior art arrangement. An estimate of the values a and b can be derived from an 

EssS 

Ll 1 0 average level of the input signals of the microphone. 

« Below will be demonstrated that maximizing the power of the combined audio 

S3 signal under the constraint that the sum of the power gains of the processing means is limited, 

results in the same values for x and y as making the output signals of the subtractors 24 and 26 

equal to 0. 

1 5 For the power measure Psum of the combined audio signal Vsum can be 

written: 

P S UM= V SUM 2 = (a • x + b • y) 2 • V IN 2 ( M ) 

For the boundary condition that the sum of the power gains of the scaling 
means is limited to a constant value can be stated: 

G p =x 2 +y 2 = l (N) 



20 



Consequently, the term (a • x + b • y) has to be maximized under the boundary condition 
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x 2 + y 2 - 1 = 0 . This can be done by using the well known Lagrange multiplier method. 
According to said method, the following expression has to be maximized: 

(a-x + b-y) 2 +?l-(x 2 + y 2 -1) (O) 



Differentiating (15) with respect to x and y and setting the derivatives to zero gives: 

2(ax + b : y)a + 2- A,-x = 0 ^pj 

2-(a-x + b-y)-b + 2-A,-y = 0 

By multiplying (16) with y and multiplying (17) with x and subtracting the results, yields: 

y = ^x (R) 

a 

Substituting (18) in (14) gives for x and y: 

These results correspond to (11) and (12). Consequently it is clear that controlling x and y to 
make the difference signals equal to 0 is equivalent to maximizing the power of the combined 
signal under the boundary condition that the sum of the power gains of the different branches 
of the processing means is limited to a maximum value. 

The above can easily be generalized for N input signals each having a transfer 
factor aj with 1< i < N. If it assumed that the processing means have N branches each 
corresponding to a signal i and having a transfer factor x; , for these values of Xj can be 
written: 

x;=-^L= (T) 




The arrangement according to Fig. 1 can be combined with delay elements to 
compensate differences in the path delays from the source of the audio signal and the several 
microphones. The arrangement according to the invention gives an improved performance, 
also during transition periods in which the delay value of the delay elements to compensate the 
path delays are not yet adjusted to their optimum value. 
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In the audio processing arrangement according to Fig. 2, input signals from 
audio sources being here microphones 30, 32 and 34 are converted into digital signals which 
are converted into block of L samples by respective series to parallel converters 36, 38 and 40. 
The output of the series to parallel converters 36, 38 and 40 are connected to corresponding 
5 inputs of the processing means 41, and to input of respective block delay elements 54, 56 and 
58. 

In the processing means 41 the output signal of the series to parallel converter 
36 is applied to a block concatenation unit 42. The block concatenating unit 42 constructs 
blocks of N+L samples from the present block of L samples and N samples from previous 
10 blocks of samples available at the output of the series to parallel converter 36. The output of 
the block concatenation unit 42 is connected to an input of a frequency domain programmable 
filter 44. The output of the frequency domain programmable filter 44, carrying a processed 
S audio signal, is connected to a first input of the combining means being here an adder 76. The 
S frequency domain programmable filter 44 presents blocks of N+L samples at its output. 
~ 1 5 In the same way the output signal of the series to parallel converter 38 is 

Q processed by a block concatenating unit 48 and a frequency domain programmable filter 46 
j= and the output signal of the series to parallel converter 40 is processed by a block 
5 concatenating unit 52 and a frequency domain programmable filter 50. Outputs of the 

M frequency domain programmable filters 46 and 50, carrying processed audio signals, are 

20 connected to corresponding inputs of the adder 76. 

jj| The output of the adder 76 is connected to an input of an IFFT unit 77 which 

Dj 

determines an Inverse Fast Fourier Transformed signal from the output signal of the adder 76. 
The output of the IFFT unit 77 is connected to an input of a unit 79 which discards N samples 
of the N+L samples at the output of the IFFT unit 77. 

25 The output signal of the unit 77 is converted into a serial stream of samples by 

the parallel to series converter 78. At the output of the parallel to series converter 78 the output 
signal of the audio processing arrangement is available. The output signal of the unit 79 is also 
applied to a block concatenating unit 74 which derives blocks of N+L samples from the 
present block of L samples at the output of the adder 76 and a block of N previous samples at 

30 the output of the unit 79. The output of the block concatenating unit 74 is connected to an 
input of an Fast Fourier Transformer 72 which calculates a N+L points FFT from the N+L 
samples at its input. The output signal of the Fast Fourier Transformer 72 represents the 
frequency spectrum of the combined signal. This frequency spectrum is applied to inputs of 
frequency domain adaptive filters 62, 66 and 68, and to an input of a normalizer 73. An output 
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of the normalizer 73 is connected to inputs of the frequency domain adaptive filters 62, 66 and 
68. 

The output of the block delay element 54 is connected to a first input of a 
subtractor 60. The output of the block delay element 56 is connected to a first input of a 
5 subtractor 64 and the output of the block delay element 58 is connected to a first input of a 
subtractor 70. The block delay elements 54, 56 and 58 are present to compensate the delay to 
which the audio signals are subjected in the frequency domain programmable filters 44, 46 and 
50 and in the frequency domain adaptive filters 62, 66 and 68. 

An output of the frequency domain adaptive filter 62 is connected to a second 
10 input of the subtractor 60 and the output of the subtractor 60 is connected to a control input of 
the frequency domain adaptive filter. An output of the frequency domain adaptive filter 66 is 
connected to a second input of the subtractor 64 and the output of the subtractor 64 is 
O connected to a control input of the frequency domain adaptive filter. An output of the 

r: frequency domain adaptive filter 68 is connected to a second input of the subtractor 70 and the 

1 5 output of the subtractor 70 is connected to a control input of the frequency domain adaptive 
O filter. 

% The frequency domain adaptive filters 62, 66 and 68 are arranged to adjust their 

9 transfer function in order to minimize the power of the input signal at their control inputs. The 

H 

M frequency domain adaptive filters 62, 66 and 68 provide their N+L filter coefficients to the 

fJJ 20 frequency domain programmable filters 44, 46 and 48. These frequency domain adaptive 
^ filters determine the conjugate value of the N+L filter coefficients before using them to filter 

the signals received from the block concatenating units 42, 48 and 52. 

In the frequency domain adaptive filters 62, 66 and 68 according to Fig. 3, a 
padding element 80 combines the L samples available at the control input of the respective 
25 frequency domain adaptive filter with N samples having a value of 0 to a block of data having 
N+L samples. This block of N+L samples is subjected to a N+L points Fast Fourier Transform 
executed by a FFT element 82. The extension of blocks of L samples to blocks of N+L 
samples before executing the FFT is done to prevent distortion of the signal due to the 
symmetry of the FFT signal around half the sampling frequency. This measure is well known 
30 to those skilled in the art of frequency domain (adaptive) filters. 

At the output of the FFT element 82 the frequency spectrum of the signal at the 
control input of the frequency domain adaptive filter(= the output of the subtractor 60, 64 and 
70 respectively) is available. The output signal of the FFT element 82 is multiplied with the 
output signal of the normalizer 73. The N+L components of the output signal of the 
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normalizer 73 represents adaptation speed values determining the speed of adaptation of the 
coefficients of the frequency domain adaptive filter. 

The output signal of the multiplier 84 is added to the output signal of the block 
delay element 112. The output signal of the block delay element 112 represents the previous 
5 values of the filter coefficients of the frequency domain adaptive filter. The output signal of 
the adder 86 is subjected to an Inverse Fast Fourier Transform executed by an IFFT element 
94. From the 2-L output samples of the IFFT element 94, the value of the final L block is set to 
zero by the element 96. Subsequently the 2-L samples (of which L samples are zero) are 
subjected to an FFT operation executed by an FFT element 110. The combination of the IFFT 
10 element 94, the element 96 and the FFT element 1 10 is present in order to avoid signal 

distortion due to the cyclic character of the FFT transform performed by the FFT processor 82. 

At the output of the FFT element 110 N+L coefficients are available for use in 
& the filter operation. These coefficients are also passed to the corresponding programmable 

}S filter. The filter coefficients are also passed to the output of the adder 86 via a block delay 

In 1 5 element 1 12. The combination of the adder 86 , the IFFT element 94, the element 96, the FFT 
t element 110 and the block delay element 1 12 determine the filter coefficient according to the 

0] following expression. 

s v i,k = v i,k-l ' E i,k (U) 

f? In (21) Vj k represents the N+L filter coefficients at instant k, v^.i represents the N+L filter 

*5 coefficients at instant k-1 , ^ represents the adaptation coefficients provided by the 

E 20 normalizer 73 to the second input of the multiplier 84 and E^j represents the frequency 
spectrum of the error signal at the output of the subtractor 60, 64 or 70 in Fig. 2. 

In the normalizer 73 according to Fig. 4, the input signal provided by the FFT 
unit 72 in Fig. 2 a conjugating element 106 determines the conjugate value of said input 
signal. This conjugate value is multiplied with said input signal by a multiplier 104. At the 
25 output of the multiplier 104 the power spectrum of the input signal is available. The output of 
the multiplier 104 is connected to an input of a multiplier 102. 

A low pass filter constituted by the multiplier 102, an adder 100, a multiplier 98 
and a block delay element 92 determines a time average of the power spectrum of the input 
signal of the frequency domain adaptive filter as available at the output of the multiplier 104. 
30 A suitable value for b is: 
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(V) 



b = l- 



I sample 

In (22) fsample is the sample frequency with which the audio signals are sampled 
and processed. A value of 32 for L has proven to be a useful value. The output of the adder 
100 carrying the time averaged power spectrum is connected to a first input of a divider 88. 
The output signal of the conjugating element 106 is scaled with a scaling factor 2a by a scaling 
element 90. A suitable value for a is 0.01 . The output signal of the scaling element 90 is 
connected to a second input of the divider 88. 

The divider 88 determines the values of by calculating the ratio of the 
conjugated FFT transform (scaled with scaling factor 2a) of the input signal of the digital filter 
and the time averaged power spectrum of the input signal of the normalizer 73 . The value of 
X u k increases proportional to the ratio between the k th component of the spectrum of the input 
signal and the k th component of the time averaged power spectrum. This results an adaptation 
speech which is the same for all frequency components irrespective of their strength. 

In the frequency domain programmable filter 44, 46 and 50 according to Fig. 5, 
the input signal is applied to the input of an FFT element 120 which calculates a N+L points 
FFT from said input signal. A conjugating element 122 determines the conjugate value of the 
parameters received from the frequency domain adaptive filters 62, 66, 68. A multiplier 124 
calculates a filtered signal by multiplying the FFT of the input signal with the conjugated filter 
coefficients received from the frequency domain adaptive filters. 

An IFFT element 126 calculates a time domain output signal from the filtered 
output signal available at the output of the multiplier 124. A discarding element discards the L 
last samples from the output signal of the IFFT element 126 and presents at its output the 
output signal of the frequency domain programmable filter. 

It is observed that a suitable choice for N is making it equal to L, but it is also 
possible to choose N smaller or larger than L. It is desirable to make N+L equal to a power of 
two in order to enable an easy implementation of the FFT and IFFT operations. 

In the time domain implementation of the audio processing arrangement 
according to Fig. 6 the outputs of microphones 30, 32 and 34 are connected to inputs of 
processing means 131 and to delay elements 186, 188 and 190. The processing means 131 
comprise time domain programmable filters 133, 135 and 137. 

The time domain programmable filter 133 comprises a plurality of cascaded 
delay elements 130, 132 and 134, and an adder 146 which adds the output signals of the delay 
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elements weighted with a weighting factor W^i * • • • W 1)N . The weighting is performed by 
the weighting elements 136, 138, 140, 142 and 144. The time domain programmable filter 135 
comprises a plurality of cascaded delay elements 148, 150 and 152, and an adder 164 which 
adds the output signals of the delay elements weighted with a weighting factor 
5 W2,i ' * * * W2,n- The weighting is performed by the weighting elements 154, 156, 158, 160 
and 162. The time domain programmable filter 137 comprises a plurality of cascaded delay 
elements 166, 168 and 170, and an adder 182 which adds the output signals of the delay 
elements weighted with a weighting factor Wm,i * * * * W^n. 

The outputs of the time domain programmable filters 133, 135 and 137, 
10 carrying the processed audio signals, are connected to the combination means being here an 

adder 184. At the output of the adder 184 the enhanced audio signal is available. The output of 
the adder 184 is connected to inputs of time domain adaptive filters 191, 193 and 195. 
y The time domain adaptive filter 191 comprises a plurality of delay elements 

M 194, 196 and 198. The output signals of the delay elements 194, 196 and 198 are weighted 

CH 15 with weighting factors Wjj • • • ■ W 1)N by weighting elements 200, 202, 204, 206 and 208. 

jjj The output signals of the weighting elements 200 208 are added by an adder 192 which 

^ provides the output signal of the adaptive filter 191 . 

M The time domain adaptive filter 193 comprises a plurality of delay elements 

C 226, 228 and 230. The output signals of the delay elements 226, 228 and 230 are weighted 

* 20 with weighting factors W 2J W 2 , N by weighting elements 216, 218, 220, 222 and 224. 

W The output signals of the weighting elements 216 224 are added by an adder 210 which 

provides the output signal of the adaptive filter 193. 

The time domain adaptive filter 195 comprises a plurality of delay elements 
236, 240 and 246. The output signals of the delay elements 236, 240 and 246 are weighted 

25 with weighting factors W Mj i W m ,n by weighting elements 234, 238, 242, 244 and 248. 

The output signals of the weighting elements 234 248 are added by an adder 232 which 

provides the output signal of the time domain adaptive filter 195. 

The outputs of the delay elements 186, 188 and 190 are connected to first inputs 
of subtractors 212, 214 and 250. The delay elements 186, 188 and 190 are present to make the 
30 impulse response of the programmable filters relatively anti-causal (earlier in time) with 

respect to the impulse response of the time domain programmable filters. Second inputs of the 
subtractors 212, 214 and 250 are coupled to outputs of the time domain adaptive filters 191, 
193 and 195. The outputs of the subtractors 212, 214 and 250 are connected to control means 
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231, 233 and 235 respectively. The control means are arranged to adjust the transfer function 
of the corresponding adaptive filter 191, 193 and 195 in order to minimize the power of the 
output signal of the corresponding subtfactor. 

The control means 23 1 , 233 and 235 are arranged for adjusting the coefficients 
5 of the adaptive filters 191, 193 and 195 according to the following expression: 

W j k (n + 1) = W jjk (n) + // • y[n - k] • e j[n] ( w ) 

In (23) Wj ? k(n) is the weight factor of the k th weighting element in the j th adaptive filter, ja is a 
adaptation constant and ej[n] is the difference between the output signal of the j th block delay 
element delaying the input signal and the output signal of the j th adaptive filter. yj[n-k] is the 
over k sample periods delayed output signal of the audio processing arrangement. These 

0 signals y[n-k] are available at the output of the delay elements of the adaptive filters. Because 
the adaptive filters all have the same input signals, the delay elements can be shared leading to 
a reduction of the required number of delay elements. 

After the coefficients Wj^(n) have been determined, these coefficients are 
reversely passed to the time domain programmable filters 133, 135 and 137. This means that 

5 the coefficients corresponding to the first taps in the adaptive filters are passed to coefficients 
of the last taps in the corresponding programmable filter. 
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